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Abstract — A novel solution is proposed to achieve a frequent 
task in wireless networks, where all node broadcast information 
to and receive information from nodes within a single hop. The 
solution exploits the multiaccess nature of the wireless medium 
and addresses the half-duplex constraint at the fundamental 
level. The defining feature of the scheme is to let all nodes send 
their messages at the same time, where each node broadcasts a 
codeword (selected from its unique codebook) consisting of on- 
slots and off -slots. A node transmits only during its on-slots, and 
listens to its peers through its own off-slots. Each node decodes 
the messages of its peers based on the superposed signals received 
through its own off-slots. Decoding can be viewed as a problem 
of sparse support recovery based on linear measurements. In the 
case that each message consists of a small number of bits, an 
iterative message-passing algorithm based on belief propagation 
is developed. In a network consisting of Poisson distributed nodes, 
numerical results demonstrate that the proposed scheme achieves 
several times the rate of slotted ALOHA and CSMA with the 
same packet error rate (1%). 

I. Introduction 

Consider a frequent situation in wireless peer-to-peer net- 
works, where every node wishes to broadcast messages to all 
nodes within its one-hop neighborhood, called its peers, and 
also wishes to receive messages from its peers. We refer to this 
problem as mutual broadcast. Such traffic can be dominant in 
many applications, such as messaging or video conferencing 
of multiple parties in a spontaneous social network, or on an 
incident scene or a battlefield. Wireless mutual broadcast is 
also critical to efficient network resource allocation, where 
messages are exchanged between nodes about their demands 
and local states, such as queue length, channel quality, code 
and modulation format, and request for certain resources and 
services. 

A major challenge in wireless networks is the half-duplex 
constraint, namely, currently affordable radio cannot receive 
useful signals at the same time over the same frequency band 
over which it is transmitting. This is largely due to the limited 
dynamic range and noise of the radio frequency circuits, which 
are likely to remain a physical restriction in the near future. 
An important consequence of the half-duplex constraint in a 
typical implementation of wireless networks is that, if two 
peers transmit their packets at the same time, they do not hear 
each other. To achieve reliable mutual broadcast using a usual 
packet-based scheme, nodes have to repeat their packets many 
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times interleaved with random delays, so that all peers can hear 
each other after enough retransmissions. This is basically the 
ubiquitous random channel access solution. 

A closer examination of the half-duplex constraint, however, 
reveals that a node does not need to transmit an entire packet 
before listening to the channel. An alternative solution is 
conceivable: Let a frame (typically of a few thousand symbols) 
be divided into some number of slots, where each node 
transmits over a subset of the slots and assumes silence over 
the remaining slots, then the node can receive useful signals 
over those nontransmission slots. If nodes activate different 
sets of on-slots, then nodes can all transmit information during 
a frame and receive useful signals within the same frame, 
and decode messages from peers as long as sufficiently strong 
error-control codes are applied. This on-off signaling, called 
rapid on-ojf-division duplex (RODD), was originally proposed 
in Q. Using RODD, reliable mutual broadcast can be achieved 
using a single frame interval. 

Importantly, RODD achieves virtual full-duplex communi- 
cation using half-duplex radios. Despite the half-duplex phys- 
ical layer, the radio appears to be full-duplex in higher layers. 
Not only is RODD signaling applicable to the mutual broadcast 
problem, it can also be the basis of a clean-slate design of the 
physical and medium access control (MAC) layers of wireless 
peer-to-peer networks. RODD-based schemes have advantages 
over state-of-the-art designs of MAC protocols, which either 
apply ALOHA-type random access or use a mixture of random 
access and scheduling/reservation. This is in part because 
RODD eliminates retransmissions due to collisions in random 
access schemes. 

In order to decode messages from neighbors, it is necessary 
to acquire their timing (or relative delay) regardless of whether 
RODD or any other physical- and MAC-layer technology is 
used. Timing acquisition and decoding are generally easier 
if the frames arriving at a receiver are synchronous locally 
within each neighborhood, although synchronization is not 
a necessity. In a wireless network, synchronization can be 
achieved using various distributed algorithms for reaching 
consensus j3)-||5] or using a common source of timing, such as 
the Global Positioning System (GPS). Whether synchronizing 
the nodes is worthwhile is a challenging question, which is 
not discussed further in this paper. 

In this paper, we focus on a special use of RODD signaling 
and a special case of mutual broadcast, where each node has 
a small number of bits to send to its peers. It is assumed that 
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node transmissions are perfectly synchronized. The goal here 
is to provide a practical algorithm for encoding and decoding 
the short messages to achieve reliable and efficient mutual 
broadcast. Decoding is in fact a problem of support recovery 
based on linear measurements, since the received signal is 
basically a noisy superposition of peers' codewords selected 
from their respective codebooks. There are many algorithms 
developed in the compressed sensing (or sparse recovery) 
literature to solve the problem, the complexity of which is 
often polynomial in the size of the codebook (see, e.g., (6)- 
[10 1). In this paper, an iterative message-passing algorithm 
based on belief propagation (BP) with linear complexity is 
developed. Numerical results show that the proposed RODD 
scheme significantly outperforms slotted-ALOHA with multi- 
packet reception capability and CSMA in terms of data rate. 

The excellent performance of the proposed scheme is be- 
cause it departs from the usual solution where a highly reliable, 
highly redundant, capacity-achieving, point-to-point physical- 
layer code is paired with a rather unreliable MAC layer. 
By treating the physical and MAC layers as a whole, the 
proposed scheme achieves better overall reliability at much 
higher efficiency. 

The remainder of the paper is organized as follows. After 
the system model is presented in Section [TTJ Section |lll] stud- 
ies the conventional random-access schemes, namely slotted- 
ALOHA with multi-packet reception capability and CSMA. 



In Section IV the proposed coding scheme for mutual broad- 



cast is described. The message-passing decoding algorithm is 
developed in Section [V] Numerical comparisons are presented 
in Section [VT] Section |VII| concludes the paper. 



II. Channel and Network Models 

A. Linear Channel Model 

Let $ = {Xi]i denote the set of nodes on the plane. We 
refer to a node by its location Xi. Suppose all transmissions 
use the same single carrier frequency. Let time be slotted and 
all nodes be perfectly synchronizedPlLet Wi £ {1, . . . , 2 1 } de- 
note the message node X; wishes to broadcast. In discrete-time 
baseband, let Sj(u7j) denote the signature (codeword) trans- 
mitted by node Xi, whose entries take values in {—1, 0, +1}. 
Let Uoi denote the complex-valued coefficient of the wireless 
link from Xi to Xq. The signal received by node Xq, if it 
could listen over the entire frame, is then described by 

x,e<s>\{x } 



U 0i Si(zji) + W 



(1) 



where W is noise consisting of independent identically dis- 
tributed (i.i.d.) circularly symmetric complex Gaussian entries 
with zero mean and unit variance, and 7 denotes the nominal 
signal-to-noise ratio (SNR). Denote the set of neighbors of 
Xq by Af(Xo). For simplicity, if we further assume that 
transmissions from non-neighbors, if any, are accounted for 
as part of the additive Gaussian noise, ([T| can be rewritten as 

Y = Vl E UoiSiiwij+W (2) 

'See jij for a discussion of synchronization issues. In 1 1 1) , cyclic codes 
are proposed to resolve the user delays in a multiaccess channel. 



where each element in W is assumed to be circularly sym- 
metric complex Gaussian with variance a 1 . The variance 
accounts for interference from non-neighbors and depends on 



the network topology. It will be derived in Section IV 



B. Network Model 

Consider a network with nodes distributed across the plane 
according to a homogeneous Poisson point process (p.p. p.) 
with intensity A. The number of nodes in any region of area 
A is a Poisson random variable with mean XA. Without loss 
of generality, we assume node X is located at the origin and 
focus on its performance, which should be representative of 
any node in the network. 

Poisson point process is the most frequently used model to 
study wireless networks (see [ 12] and references therein). The 
RODD signaling and the mutual broadcast scheme proposed 
in this paper are of course not limited to homogeneous 
Poisson distributed networks. The homogeneous p.p.p. model 
is assumed in this paper to facilitate analysis and comparison 
of RODD and competing technologies. 

C. Propagation Model and Neighborhood 

It is assumed that the large-scale signal attenuation over 
distance follows power law with some path-loss exponent 
a > 2, and the small-scale fading of a link is modeled by 
a Rayleigh random variable with mean equal to 1. There 
are different ways to define the neighborhood of a node. For 
concreteness, we say that nodes Xi and Xj are neighbors of 
each other if the channel gain between them is no less than a 
certain threshold, denoted by 6. Link reciprocity is regarded 
as given. 

For any pair of nodes Xj, Xj £ $, let Rij = |Xj — Xj\ and 
Gij denote the distance and the small-fading gain between 
them in a given frame, respectively. Then the channel gain 
from Xj to Xi is GijR^ a . The neighborhood of a node 
depends on the instantaneous fading gains. Specifically, we 
denote the set of neighbors of node Xi as 



Af(X i ) = {X j £$:G ij Rr j a >9,j^i} 
The channel coefficient Uij should satisfy \Uij 



(3) 



G ij R ij a > 

where its phase is assumed to be uniformly distributed on 
[0, 2tt) independent of everything else. Assuming the Poisson 
point process network model introduced in Section |II-B| the 
distribution of the amplitude of channel coefficient Uoi in ([T| 
for an arbitrary neighbor Xj £ Af(Xo) is derived in the 
following. 

Without loss of generality, we drop the indices and i, 
and use R and G to denote the distance and the fading gain, 
respectively. Since the two nodes are assumed to be neighbors, 
G and R satisfy GR~ a > 9, i.e., R < {G/6) 1/a . Under 
the assumption that all nodes form a p.p.p., for given G, 
this arbitrary neighbor Xj is uniformly distributed in a disc 
centered at node Xo with radius (G/0) 1 . Therefore, the 
conditional distribution of R given G can be expressed as 

P(R < r\G) = mm\ 1, . , 



(4) 
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Now for every u > y/0, by (Hh we have 



P(GR- a > u 2 ) = E G I P \R < 




(5) 



U c 



Hence the probability density function (pdf) of | Uoi | of neigh- 
bor X, is 



p{u) 



4 2 /° 



0, otherwise. 



(6) 



In fact, coefficient vector Qi = (Gji)j for all j ^ i can be 
regarded as a mark of node Xj, so that $ = {(Xj,0j)}j is a 
marked p.p.p. Denote 



$ = $\(X ,So) 



(7) 



given that (Xo, Go) is at the origin. By the Slivnyak-Meche 
theorem [121, $ is also a marked p.p.p. with intensity A. By 
the Campbell's theorem fl2) , the average number of neighbors 
of Xo can be obtained as: 



= eJ ]T i(G 0i R^>e) 

1(^,^)6* 
j*OQ poo 

= 2ir\ 1 (gr~ a > 9) re~ 9 drdg 

Jo Jo 

= \\e- 2 / a T ( -) (8) 
a \a I 



where l(-) is the indicator function and T( ) is the Gamma 
function. 



III. Random-Access Schemes 

In this section we describe two random access schemes, 
namely slotted ALOHA and CSMA, and provide lower bounds 
on the error probability for a given number of symbol trans- 

to compare 



missions. The results will be used in Section VI 



with the performance of our proposed sparse recovery scheme. 

Let L denote the total number of bits encoded into a frame, 
which includes an /-bit message and a few additional bits 
which identify the sender. This is in contrast to broadcast via 
sparse recovery, where the signature itself identifies the sender 
(and carries the message). Each broadcast period consists of 
a number of frames to allow for retransmissions. Without 
loss of generality, we still consider the typical node Xo at 
the origin. An error event is defined as that node Xo cannot 
correctly recover the message from one specific neighbor. The 
corresponding error probabilities achieved by slotted ALOHA 
and CSMA are denote by Pj; and P[j, respectively. 



A. Slotted ALOHA 

In slotted ALOHA, suppose each node chooses indepen- 
dently with the same probability p to transmit in every frame 
interval. A message is assumed to be decoded correctly if 
the signal-to-interference-plus-noise ratio (SINR) in the cor- 
responding frame transmission is no smaller than a threshold 
5 (multi-packet reception is possible only if 5 < 1). Over the 
additive white noise channel with SINR 5, in order to send L 
bits reliably through the channel, the number of symbols in a 
frame must exceed 

log 2 (1 + 6) ■ (9) 

Let X denote one specific neighbor of node Xo and G 
denote the fading coefficient between them. Suppose the mark 
of X is denoted by Q. Given that (X.Q) G $ where $ 
is given by |7}, denote $i = $\{(X, Q)}, which is also 
a marked p.p.p. with intensity A. For a given realization 
of (X, G) and $i, define P\(X, G, $i) as the probability 
that the received SINR from X to Xq is no less than the 
threshold 5 conditioning on that X transmits in a given frame. 
In any given frame, the probability of the event that X 
transmits, Xo listens, and the transmission is successful is 
thus p(l -p)Pl(X, G, $i). Therefore, the probability that the 
message from X has not been successfully received by Xo 
after Mj consecutive frame intervals can be expressed as 

l-p{l-p)Pl{X,GM)] Mt \ (10) 



where the expectation is over the joint distribution (X, G, <l>i). 
Due to the convexity of function (max{0, 1 — z}) n , z > 
0, n € {1, 2, . . . }, PI in ( fT0| ) can be lower bounded as 

Pl>(m & K{0,l-p(l-p)E{Pl(X,G^ 1 )}]) Mf . (11) 

In Appendix [a] the expectation of P|(X, G, $i) is calcu- 
lated using the known Laplace transform of the distribution 
of the interference p2) . For a given number of symbol 
transmissions M a , the lower bound on Pj; is presented in the 
following result. 

Proposition 1: Consider an arbitrary neighbor X of node 
Xo. The probability that Xo cannot successfully receive the 
message from X after M a symbol transmissions is lower 
bounded as follows: 

P: > ^max jo, 1 - - p) (jj sin T(l - b) 



exp { -Xp ,"" (uj) b - l 
sm(07r) 7 



(12) 



where t = b = 2/a and n a — M a log 2 (1 + S)/L. 

Although ( fT2| i appears to be complicated, computing it 
only involves a straightforward single-variable integral (the 
outcome of the integral is in fact real-valued). 

In the slotted ALOHA scheme, despite repeated transmis- 
sions, a given link may still fail to deliver the message due 
to the half-duplex constraint (the receiver happens to transmit 
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during the same frame) and consistently weak received SINR 
due to random interference from other links. 



B. CSMA 

As an improvement over ALOHA, CSMA lets nodes use 
a brief contention period to negotiate a schedule in such 
a way that nodes in a small neighborhood do not transmit 
data simultaneously. We analyze the performance of CSMA 
by using the Matern hard core model fTZ) . To be specific, 
consider the following generic scheme: Each node senses the 
channel continuously; if the channel is busy, the node remains 
silent and disables its timer; as soon as the channel becomes 
available, the node starts its timer with a random offset, and 
waits till the timer expires to transmit its frame. Clearly, the 
node whose timer expires first in its neighborhood captures 
the channel and transmits its frame. 

Mathematically, let {T^} be i.i.d. random variables with 
uniform distribution on [0,1], which represent the timer offsets 
for all nodes {Xi} in respectively. By viewing T{ as a mark 
of node Xi we redefine $ = {(Xi,Qi,Ti)}i, which is still a 
marked p.p.p. with intensity A. The medium access indicators 
{e.i}i are additional dependent marks of the nodes in $ defined 
as follows: 



E i =l(T j >T l , VX,- e Af{Xi)) . 



(13) 



The probability of Tj = Tj for j ^ i is zero. Node Xi will 
transmit its frame if and only if Ei = 1. 

The same as in the slotted ALOHA case, a message is 
assumed to be decoded correctly if SINR in the corresponding 
frame transmission is no smaller than S. It follows that the 
number of symbols in a frame must exceed d9). 

Let X be one specif ic neighbor of Xq. Define G and 
Q as in Section 



m-A 



Given that (X,Q) G $, denote 
= $>\{(X, Q)\. For a given realization of (X, G) and i>i, 
define Pl(X,G,&i) as the probability that node X transmits 
its frame and the received SINR from X is no less than the 
threshold 5. Therefore, the probability that the message from 
X has not been successfully received after Mf consecutive 
frame intervals can be expressed as 



i-p:(x,g,$i) 



Mf 



> (maxjo.l-Efp^^G,^)}} 



(14) 



where the expectation is over the joint distribution of 
(X, G, $i), and ([14} is due to the convexity of function 
(max{0, 1 - z}) n , z > 0, n = 0, 1, . . . . 

For any given number of symbol transmissions M c , the 
lower bound on error probability PI is given by the following 
result, which is proved in Appendix [B] 

Proposition 2: Consider an arbitrary neighbor X of node 
Xq. The probability that Xq cannot successfully receive the 
message from X after M c symbol transmissions is lower 
bounded as follows: 



> max ( 0, 1 - 




(15) 



where c is defined in ([H) and n c = M c log 2 (1 + S)/L. 

In contrast to slotted ALOHA, frame loss due to the half- 
duplex constraint is eliminated through contention. However, 
a given link may still fail to deliver the message after repeated 
transmissions because the received SINR is constantly weak 
due to random interference outside the neighborhood. 

IV. Encoding for Mutual Broadcast 

In contrast to random-access schemes, where many retrans- 
missions are needed to achieve a desired error performance, 
we next describe a unique signaling that allow all message 
exchanges to finish within one (longer) frame of transmission. 
The key idea is that each node broadcasts a codeword consist- 
ing of on-slots and off-slots. A node transmits only during its 
on-slots, and listens to its peers through its own off-slots. 

Suppose each node Xi is assigned a unique codebook of 
2 l on-off signatures (codewords) of length M s , denoted by 
{S'i(l), . . . , Si(2 1 )}. The optimal design of the signatures is 
out of the scope of this paper. For simplicity, let each element 
of each signature be generated randomly and independently, 
which is with probability 1— q and 1 and —1 with probability 
q/2 each. Node Xi broadcasts its /-bit message (or information 
index) zui € {1, . . . , 2 1 } by transmitting the codeword Si(wi). 
As discussed in Sections [I] and [IIJ it is reasonable to assume 
that node transmissions are synchronized. 

In each symbol slot, those transmitting nodes in <t form 
an independent thinning of $ with retention probability q, 
denoted by & q . $ g is still an independent marked p.p.p. but 
with intensity Xq. Thus, the sum power from all transmitting 
non-neighbors of node Xq in each time slot is derived as 



jGoiR 0i a l (G ii? 0i Q < 




-8r a 



3 drd 9 



dr 



TT\qj9 1 ~ 2 / a r 

a(a — 2) 

Therefore, the variance of each element of W in (|2) is 

nXq'yd 1 - 2/ot r 



a 2 = 



a(a 



a 



1. 



(16) 



(17) 



The signal received by the typical node Xq, if it could 
listen over the entire frame, is described by Q. Suppose 
|A/"(Xo)| = K and the neighbors of Xq are indexed by 
1, 2, . . . , K. The total number of signatures of all neighbors is 
N = 2 l K. Due to the half-duplex constraint, however, node 
Xq can only listen during its off-slots, the number of which 
has binomial distribution, denoted by M ~ B(M S: 1 — q), 
whose expected value is E {M} = M s {\ — q). Let the matrix 



S e 



pMxN 



consist of columns of the signatures from all 



neighbors of node Xq, observable during the M off-slots of 
node Xq, and then normalized by yjM s q{\ — q) so that the 
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expected value of the I2 norm of each column in S is equal to 
1. Based on Q, the M-vector observed through all off-slots 
of node X can be expressed as 



where 



Y = JT S SX + w 



7s = jM a q(l - q)/a 2 



(18) 



(19) 



and X is an TV-vector indicating which K signatures are 
selected to form the sum in Q as well as the signal strength 
for each neighbor. Precisely, X^_ 1 ^ 2 i+i = Ujl(wj = i) for 
1 < 3 < K and 1 < i < 2 l . For example, consider 
K = 3 neighbors with I = 2 bits of information each, where 
101 = 3,W2 = 2,u>3 = 1, then vector X is expressed as 



X = [0 Z7i u 2 U 3 0]. 



(20) 



The sparsity of X is exactly 2 , which is very small for large 
I. The average system load is defined as /3 — E{A^} /(M s (l — 
q))=2 l c/(M s (l-q)). 

In general, the decoding problem node Xq faces is to 
identify, out of a total of N = 2 l K signatures from all its 
neighbors, which K signatures were selected. This requires 
every node to know the codebooks of all neighbors. One 
solution is to let the codebook of each node be generated 
using a pseudo-random number generator using its network 
interface address (NIA) as the seed, so that it suffices to 
acquire all neighbors' NIAs. This, in turn, is a neighbor 
discovery problem, which has been studied in p3)-p3). The 
discovery scheme proposed in fl4) , ]f5) uses similar on-off 
signalling and also solves a compressed sensing problem. 

V. Sparse Recovery (Decoding) via Message 
Passing 

The problem of recovering the support of the sparse input 
X based on the observation Y has been intensively studied 
in the compressed sensing literature. In this section, we 
develop an iterative message-passing algorithm based on belief 
propagation. The reasons for the choice include: 1) It is one 
of the most competitive decoding schemes in terms of error 
performance; and 2) the complexity is only linear in the vector 
to be estimated. 



A. The Factor Graph 

Belief propagation belongs to a general class of message- 
passing algorithms for statistical inference on graphical mod- 
els, which has demonstrated empirical success in many ap- 
plications including error-control codes, neural networks, and 
multiuser detection in code-division multiple access (CDMA) 
systems. 

In order to apply BP to the coded mutual broadcast problem, 
we construct a Forney-style bipartite factor graph to represent 
the model (TTJJ. Here, we separate the real and imaginary parts 
in ( p~8] > as 

= jY s SX^ + = yfuSXW + W W 

(21) 



where the superscripts (1) and (2) represent the real and 
imaginary parts respectively, W 



W 4 - 



1, 2 consists of i.i.d. 



Gaussian random variables with zero mean and variance 1/2. 
The message passing algorithm we shall develop based on pTj ) 
is not optimal, but such separation facilitates approximation 



and computation, which will be discussed in Section V-B 



Since two parts in ( pi) share the same factor graph, we treat 
one of them and omit the superscripts: 



N 

k=l 



(22) 



where fi 6 1, 2, . . . , M and k € 1,2, ... ,N index the 
measurements and the input "symbols," respectively. For 
simplicity, we ignore the dependence of the symbols {X^} 
for now, which shall be addressed toward the end of this 
section. Each Xk then corresponds to a symbol node and 
each corresponds to a measurement node, where the joint 
distribution of all {Xk} and {Y^} are decomposed into a 
product of M + N factors, one corresponding to each node. 
For every (fi, fc), symbol node k and measurement node fi 
are connected by an edge if s M fe 7^ 0. A simple example is 
shown in Fig. [T] for 5 measurements and 3 neighbors each 
with 4 messages, i.e., M = 5, K = 3, and N = 3 x 4 = 12. 
The actual message chosen by each neighbor for broadcast is 
marked by a dark node, i.e., W\ — 3,w 2 — 2, w 3 = 1. 




%1 I -Xz| X3\ A4 %6 *7 Ag Xg I Xio\ Xll Xj2 

Fig. 1. The Fomey-style factor graph of coded mutual broadcast. 



B. The Message-Passing Algorithm 

In general, an iterative message-passing algorithm involves 
two steps in each iteration, where a message (or belief, which 
shall be distinguished from an information message) is first 
sent from each symbol node to every measurement node it is 
connected to, and then a new set of messages are computed 
and sent in the reverse direction, and so forth. The algorithm 
performs exact inference within finite number of iterations if 
there are no loops in the graph (the graph becomes a tree if 
it remains connected), while it provides in general a good 
approximation for loopy graphs as the one in the current 
problem. 

For convenience, let dfi (resp. dk) denote the subset of 
symbol nodes (resp. measurement nodes) connected directly 
to measurement node ijl (resp. symbol node k), called its 
neighborhood. ^ \ d/j, | (resp. \dk\) represents the cardinality of 
the neighborhood of measurement node /1 (resp. symbol node 
k). Also, let dfj\k denote the neighborhood of measurement 

2 This is to be distinguished from the notion of neighborhood in the wireless 
network defined in Section III-CI 
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node /i excluding symbol node k and let dk\/j, be similarly 
defined. 

The message-passing algorithm given as Algorithm [T] de- 
codes the information indexes iui, . . . , wk, and is ready for 
implementation. The key steps are described as follows. The 
superscripts i = 1,2 in Algorithm [T] represent the real and 



imaginary parts, respectively. Here, E < X 



varlX 



and 



Y = y;a 2 j represents the conditional mean and 

variance of the input given the Gaussian channel output Y = 
X + W with W ~ 7V(0, a 2 ) is equal to y. Mathematically, 
assume X has cumulative distribution function Px{x), then 
for i = 1,2,..., 



EiX* 



and 



var<^ X 



Y = y;a< 



Y = y;a< 



I- 



-dPx(x) 



^dP x (x) 

Y = y;a- 

Y = y;a 2 



(23) 



(24) 



where J -dPx(x) in ( |23] l denotes the Riemann-Stieltjes inte- 
gral. 

Algorithm 1 Message-Passing Decoding Algorithm 



Input: S,Y,-f s ,M s ,q. 
Initialization: 

vlJi^/lls^k) for all s Mfe ^ and i = 1, 2. 



Initialize r° : * to a large positive number for i = 1, 2. 
Mfl/n iterations: 
for t = 1 to T - 1 do 

for all /i, fc with s^fe 7^ and i = 1, 2 do 

E„ 



end for 



var <; 

1 



t-1, 



Y 



\dk\~\ 

E„f=3fc 



|afe|-i 



t-i, 

|9fc|-l 



|0fc|-l 



T — 1 



|9fc|-l ' |9fc|-l 



for all k, i — 



argmax J=lj ... i2i |m (fe _ 1)2 , +j 



for i = 1,2 
13: end for 

14: m\ <- E X 
1,2. 

15: Output: Wk 



In the following, we derive Algorithm [T] starting from ([22]) 
which is valid for both real and imaginary parts in ( |2"Tj i. It is 
a simplification of the original iterative BP algorithm, which 
iteratively computes the marginal a posteriori distribution 
of all symbols given the measurements, assuming that the 
graph is free of cycles. For each k £ dp, (hence fj, 6 dk), 
let {Vfr represent the message from symbol node k 



to measurement node p, at the t-th iteration and {U^ k (x)} 
represent the message in the reverse direction. Each message 
is basically the belief (in terms of a probability density (or 
mass) function) the algorithm has accumulated about the 
corresponding symbol based on the measurements on the 
subgraph traversed so far, assuming it is a tree. Let px(x) 
denote the a priori probability density function of X. In the 
t-th iteration, we compute 



V k \{x)<x Px {x) J] Ull\a 
for all (fe, jj) with s^k ^ 0, and then 



(25a) 



U, 



MfcO) « / <'-M> 

(xj)dv.\k 



jedn\k 



HsS^kX 

n ^>fe) da! 



(25b) 



where f, , denotes integral over all x 7 with j € du\k, 
and V(a;) oc means that U(a;) is proportional to w(x) 

with proper normalization such that V(x)dx = 1. In 
case X is a discrete random variable, the integral shall be 
replaced by a sum over the alphabet of X. In this problem, X 
follows a mixture of discrete and continuous distributions, so 
the expectation can be decomposed as an integral and a sum. 

The complexity of computing the integral in ( |25b| l is ex- 
ponential in \dfi\ = 0(qN), which is in general infeasible 
for the problem at hand. However, as qN ^> 1, the compu- 
tation carried out at each measurement node admits a good 
approximation by using the central limit theorem. A similar 
technique has been used in the CDMA detection problem, for 
fully-connected bipartite graph in |16|-|18|, and for a graph 



with large node degrees in 1 19 1 



To streamline d25a|l and (|25b|), we introduce tii) 



and 



"kfi 

( a kfi) 2 f° r a ^ (/-*' fc) pairs with s^ k ^ to represent the mean 
and variance of a random variable with distribution V k „(x). 
Using Gaussian approximation, one can reduce the message- 
passing algorithm to iteratively computing the following mes- 
sages: 



l kfj. 



EiX 



Y 



V 7 l ~ x t-i 

2-,v£dk\n ^vk , T 



\dk\ - 1 ' \dk\ - 1 



(a*) 2 = w{X 



Y = 



y t—\ , 



\dk\ - 1 ' \dk\ - 1 



(26a) 



(26b) 



1 



c fik 



/IsS^k 



j€dfj,\k 

j^W? |a,i| J £ ( ^ )2+ ^ M - ?(1 - 9) 

(26d) 



where ( |26a| i and ( |26b[ ) calculate the conditional expectation 
and variance, respectively. The detailed derivation is relegated 
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to Appendix [C] At the T-th iteration, the approximated pos- 
terior mean of Xk can be expressed as 



EiX 



Y 



jr-i 



edk ' 



r-T-1 



\dk\ - 1 ' \dk\ - 1 



(27) 



It is time consuming to compute ( |26a| i and ( |26b| > for all 
(/x, fc) pairs with s^k 7^ 0, especially in the case of large matrix 
S. We can use the following two approximation techniques 
to further decrease the computational complexity. First, \dk\ 
in ( ]26a) , ( |26b| i and ( |27] i is replaced by its mean value 
M a q(l — q). Second, we use interpolation and extrapolation 
to further reduce the computation complexity of ( |26a) , \2&o\ 
and p7) . Specifically, in each iteration t, we only compute the 
conditional mean and variance for some chosen y's, i.e., we 
choose y\ < y\ < • • • < which is a partition of an interval 
depending on r t_1 , compute 



6* = var 



9) 



1 



y = y) 



(28) 



(29) 



T 

M s q(l - 

M s9 (l - 9) - 1 

1,2, ...,ri, and then use those values to calcu- 
( |26b| > by interpolation or extrapolation. To be more 
precise, for any pair //, fc with s M /j 7^ 0, suppose and 

are chosen to be the closest to y = — 



for j = 
late d26ab. 



and (<TjL) can be approximated by 



|9fc|-l 



then "*fc M 



fe/J 



(30) 



(31) 



We now revisit the assumption that X has independent 
elements. In fact, X consists of K sub-vectors of length 2 l , 
where the entries of each sub-vector are all zero except for 
one position corresponding to the transmitted message. After 
obtaining the approximated posterior mean m& by incorporat- 
ing both real and imaginary parts calculated from ( |27) , Algo- 
rithm [T] outputs the position of the element with the largest 
magnitude in each of the K sub-vectors of [mi, . . . , mjy]. In 
fact the factor graph Fig. [T] can be modified to include K 
additional nodes, each of which puts a constraint on one sub- 
vector. Slight improvement over Algorithm [T] can be obtained 
by carrying out message passing on the modified graph. 

VI. Numerical Results 

In order for a fair comparison, we assume the same power 
constraint for both the sparse recovery scheme and random- 
access schemes, i.e., the average transmit power in each active 
slot (in which the node transmits energy) is the same. Recall 
that, in the proposed scheme, where the frame length is M s , 
the SNR in the model ( fTS) is 7 S = jM s q(l— q). We choose the 
same transmission probability in each slot for sparse recovery 
scheme and slotted ALOHA, i.e., q = p = l/(c+l). Also, the 
transmission probability in each slot for CSMA is (1 — e~ c ) jc 




-ALOHA M =1800 

a 

. CSMA M =950 



(see Appendix III-Bi, which is close to l/(c + 1) when c 



Fig. 2. Low bounds for error probability in slotted-ALOHA and CSMA for 
different threshold 8 in the case of I = 10. 



is large. The three schemes consume approximately the same 
amount of average power over any period of time. 

Without loss of generality, let one unit of distance be 1 
meter. Consider a wireless network of 1000 nodes uniformly 
distributed in a square with side length 500 meters. The nodes 
form a Poisson point process in the square conditioned on 
the node population. Suppose the path-loss exponent a = 4. 
The threshold of channel gain to define neighborhood is set 
to 8 = 10~ 6 . It means that if the transmit power for a node 
one meter away is 60 dB, then the SNR attenuates to dB 
at = 10 6 / Q fa 31 meters in the absence of fading, i.e., the 
coverage of the neighborhood of a node is typically a circle 
of radius 31 meters. According to ([8]), a node near the center 
of the square (without boundary effect) has on average c~ 11 
neighbors. 

We consider two cases for the length of broadcast message 
I = 5 and 10 bits. In random access schemes, a packet of 
L bits consists of /-bit message and [log 2 c] additional bits 
to identify the sender. Fig. [2] shows that T = 3.5 minimizes 
the lower bounds for P5; in Jl2\ and in < fT3] > in the case of 
I = 10. 

The metric for performance comparison is the probability 
for one node to miss one specific neighbor, averaged over 
all pairs of neighboring nodes in the network. Suppose the 
transmit power of each node is 7 = 60 dB. First consider 
one realization of the network where each node has c k 11 
neighbors on average and I = 5 bits to broadcast, so that 
on average cl ~ 55 bits are to be collected by each node. In 
slotted ALOHA, at least 4 additional bits are needed to identify 
a sender out of 1000 nodes, so we let L = 9. In Fig. [5] the 
error performance of slotted ALOHA and CSMA for 6 = 0.5 
is compared with that of the sparse recovery scheme with 
the message-passing algorithm. The simulation result shows 
the sparse recovery scheme significantly outperforms slotted 
ALOHA and CSMA, even compared with the minimum of 
the lower bounds computed from ( p~2] > and <JT3J for 5 — 3.5. 
For example, to achieve 1% error rate, the sparse recovery 
scheme takes fewer than 300 symbols. Slotted ALOHA and 
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-O- 1 ALOHA with 6=0.5 
-Q- ' CSMA with 6=0.5 
1( Sparse recovery 
via message passing 
-0— ALOHA bound with 5=3.5 
— B — CSMA bound with 5=3.5 




. + - l=5,M =280 
1 s 

-M — l=10,M =450 



62 64 



Number of symbol transmissions 



Y(dB) 



Fig. 3. Performance comparison between sparse recovery and random access. Fig. 5. Performance of sparse recovery scheme in different nominal SNR 



Each node transmits a 5-bit message. 



(7)- 



ALOHA with 5=0.5 
CSMA with 5=0.5 
Sparse recovery 
via message passing 
ALOHA bound with 5=3.5 
CSMA bound with 5=3.5 




Number of symbol transmissions 



Fig. 4. Performance comparison between sparse recovery and random access. 
Each node transmits a 10-bit message. 



CSMA take no less than 800 and 400 symbols according to the 
bounds in ( |T2| i and ( fl5] >, respectively. In fact, slotted ALOHA 
and CSMA with threshold S = 0.5 take more than 2000 
symbols. Similar comparison is observed for several other 
SINR thresholds S around 0.5 and the performance of ALOHA 
and CSMA are not good for S > 1 because the messages from 
weaker neighbors may never be successfully delivered. Some 
additional supporting numerical evidence is, however, omitted 
due to space limitations. 

Fig. [4]repeats the experiment of Fig. [5] with 10-bit messages. 
The sparse recovery scheme has significant gain compared 
with slotted ALOHA and CSMA. For example, to achieve the 
error rate of 1%, the sparse recovery scheme takes about 450 
symbols, whereas slotted ALOHA and CSMA take at least 
1000 and 650 symbols, respectively. 

In Fig. [5] we simulate the same network with different 
nominal SNRs, i.e., 7 varies from 50 dB to 70 dB. In the case 
that each node transmits a 5-bit message, the frame length is 



chosen to be 280 symbols. It can be seen from the figure that 
the probability of error decreases with the increase of SNR. 
The performance is similar when each node transmits a 10-bit 
message and the frame consists of 450 symbols. 

VII. Concluding Remarks 

The idea of using on-off signalling to achieve full-duplex 
communication using half-duplex radios applies to general 
peer-to-peer networks, and is not limited to mutual broadcast 
traffic focused on in this paper. The sparse recovery scheme 
with random signatures are of course most suitable for the 
situation where the broadcast messages consist of a relatively 
small number of bits. If each node has many bits to send, a 
structured code with low decoding complexity is needed for 
the scheme to be practical. This is ongoing research. 

The proposed scheme can also serve as a highly desirable 
sub-layer of any network protocol stack to provide the impor- 
tant function of simultaneous message exchange among peers. 
This sub-layer provides the missing link in many advanced 
resource allocation schemes, where it is often assumed that 
nodes are provided the state and/or demand of their peers. 

Appendix A 
Proof of Proposition!]] 

Let $^ be an independent thinning of $1 with retention 
probability p to represent the transmitting nodes. It is easy to 
see that $^ is an independent marked p.p.p. with intensity Xp. 
Denote 



(32) 



then we have 



= E 




9 





poo 


1 




J — CO 


i + - 

7 



| (33) 
Pi(i)di (34) 



where ( [33] > is derived from |5]l and pj is the pdf of interference 
I. 

According to (12), the Laplace transform of pj can be 
expressed as 



C PI (s) — exp < —Xps 



2ir 2 



a sin(27r/a) 



(35) 



Therefore, the Fourier transform^] of pi can be obtained by 
replacing s in ( |35| ) by iw with t = as 



= exp \ -\p(Lu) 2/a — r 



a sin(27r/a) 

Since the Fourier transform of \x\ a for — 1 < a < is 

2sin(a7r/2)r(a + 1) 

the Fourier transform of 

qi (i) = \i + l\-£ 

7 

for a > 2 can be expressed as 



'a 



w/7 2sin(7r/a)r(l-2/q) 
1 J M 1 " 2 /" 



(36) 



(37) 



(38) 



(39) 



Since the integral in ( |34| i can be viewed as the Fourier 
transform of pi(i)qj(i) at w = 0, it can be calculated as the 
convolution of F Pi {uS) and I qi (uf) at oj = 1201 . Therefore, 
by (|36|l and ([39"]>, we have 





1 

1+ - 


/ — CO 


7 



Pi(*)di = 2-^1(^*^(0;) 



(40) 



where * is the convolution operator. Therefore, according 
to ( fTTj ), ( f3"4"] > and ( |40] >, the error probability Pjj can be lower 
bounded as 



1 



P e a > max<0,l— p(l-p) 



7" px (w) * J" 9J (w) 



(41) 



j=0 



According to |9|, the number of frames in a period of M a 
symbol intervals should satisfy 



M f > M a log 2 (1 + S)/ L . 



(42) 



Therefore, ( fL?) in Proposition [T] follows by combining ( |4"Tj ) 
and d42l. 



3 The reasons to work with Fourier transform in lieu of Laplace transform 
are: 1) The inverse Fourier transform here is easier to calculate; 2) the Fourier 
transform of \i + — | _ 5 has a closed form. 



Appendix B 
Proof of Proposition^ 

Denote Gi as the fading coefficients node X.- L G $1 to node 
Xq. Define the following indicators for node X 

F 1 =1(T >T) (43) 

F 2 = 1 (Ti > T, VX t e $1 with Gi\Xi - X\~ a > £?) (44) 

F 3 = l( 7 G\X\- a >6) (45) 

where i*i = 1 if and only if the timer of X expires before 
that of Xo, F2 = 1 if and only if X's timer expires sooner 
than those of all its neighbors excluding Xq, F3 = 1 if and 
only if the received SNR from node X to node X exceeds 
the threshold i5. In order for the transmission to be successful, 
we must have Fi = F2 = -F3 = 1. That is 

E |Pc(^> < E {FiF 2 F 3 } . (46) 

Conditioned on T = q, we express the indicator as the 
value of some extremal shot-noise |T2| Section 2.4]. For fixed 

define the indicator of the event that Xi is a neighbor of X 
and it has a timer smaller than <;: 

L(X,Xi,Gi,Ti) =l(Gi\Xi-X\- a >6 and T, < ?) (47) 

for all (XijQijTi) £ $1. Define the extremal shot-noise at 
node X as 

Z i (X)= max L(X,X h G h Ti). (48) 

Note that (X) takes only two values or 1 and conse- 
quently 

E{F 2 \T = ^ =p{%(X)<o|t = ^}. (49) 
By JT2} Proposition 2.4.2], ( |49] i can be further calculated as 
e{f 2 |t = <;} 

= exp ( - A f f [ 1 (L(X, x, g, t) = l) e- g dtdgdx\ 

L JR 2 Jo JO J 

= exp j-27rAc;^ ^ 1 (gr~ a > 6) re~ 9 d 5 dr| 

= e~ c? (50) 

where c is the average number of neighbors defined in ([8). 
Therefore, according to (j46), we have 



{p*(x,s,<li)} 



< 



(1 - c:)e- cs d^ 



1 



c-1 



(51) 
(52) 



where ( |5"Tj ) is derived from the the uniform distribution of 
T , ((5]) and g9j. 

According to {9), the number of frames in a period of M c 
symbol intervals should satisfy 

M f >M c log 2 (l + S)/L. (53) 

Therefore, Proposition [2] follows by combining ( |52"] i and ( [53) . 
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As a by-product, by averaging over <; in ( p0| >, which is 
uniformly distributed on [0,1], the probability that a given 
node captures the channel to transmit in each slot can be 
calculated as (1 — e _c )/c. 

Appendix C 
Derivation of Message Computation ( p6) > 

We derive ( |26] i from ( |25j ). Denote A M fc = Yljed/Ak s M x r 
The key to the simplification is to recognize that A M fc is 
approximately Gaussian. To be precise, if {Xjjjga/Afc were 
independent (conditioned on the observations traversed so far 
on the graph), then, by central limit theorem, Auk converges 
weakly to a Gaussian random variable, whose mean is 



and variance is 



2J s M m m 
jedfi\k 

E 4M,f 



(54) 



(55) 



jedfjt\k 

Using the preceding Gaussian approximation, ( |25b[ ) can be 
calculated by a change of probability measure as 



UL(x) oc 



exp 
1 



exp 



cx exp 



2r. 



fik 



where z^ fe is defined in ( |26c| > and 



' fik 



= E 

jedfi\k 



t \2 
fi) 



2 7s s 2 



dA 



(56) 



(57) 



fik 



Using law of large numbers, we further approximate t k by its 
average over all (fi, k) pairs with s^k ^ 0, i.e., r* fc is replaced 

by 



i 



EE E W 



i 



fe p.ek j£dfi\k 



2 ls s 2 



E 1^1 E (^>) 2 + - g) (58) 



as shown in ( |26d] >. 

Now we have J7*j. ~ N(z t ^ ikl T t ), so it is easy to see that 



n 



• jV 



- 1 ' |5fc| - 1 



(59) 



According to (|25a[), (x) can be viewed as the conditional 



^"\dk\-i'' k J ' where F is the 0Ut P ut 
of a Gaussian channel with noise ~ A/" ^0, ig^j^ 
Therefore, by definition, m^ 1 and (c^ 1 ) 2 can be expressed 



distribution pxiy x 



as the conditional mean and conditional variance as in ( |26a[ ) 
and |26b|. 
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